# Scene Parsing
Segformer B0 Scene Parse 150
Other
Lightweight image segmentation model based on MIT-B0 architecture, optimized for scene parsing tasks
Image Segmentation
Transformers

S
univers1123
20
0
Upernet Swin Small
MIT
UperNet is a framework for semantic segmentation, utilizing Swin Transformer as the backbone network to achieve pixel-level semantic label prediction.
Image Segmentation
Transformers English

U
openmmlab
1,467
5
Upernet Convnext Large
MIT
UperNet is a semantic segmentation framework combined with the ConvNeXt large backbone network for pixel-level semantic label prediction.
Image Segmentation
Transformers English

U
openmmlab
23.09k
0
Upernet Convnext Small
MIT
UperNet is a framework for semantic segmentation that uses ConvNeXt as its backbone network, enabling pixel-level semantic label prediction.
Image Segmentation
Transformers English

U
openmmlab
43.31k
31
Smallcap7m
A model capable of converting image content into textual descriptions, suitable for various vision-language tasks.
Image-to-Text
Transformers English

S
Yova
977
5
Segformer B2 Finetuned Ade 512 512
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for image segmentation tasks at 512x512 resolution.
Image Segmentation
Transformers

S
nvidia
44.07k
3
Segformer B5 Finetuned Ade 640 640
Other
SegFormer is a Transformer-based semantic segmentation model fine-tuned on the ADE20k dataset, suitable for image segmentation tasks.
Image Segmentation
Transformers

S
nvidia
42.32k
39
Maskformer Swin Large Ade
Other
Semantic segmentation model trained on the ADE20k dataset, using a unified framework for instance segmentation, semantic segmentation, and panoptic segmentation tasks
Image Segmentation
Transformers

M
facebook
4,708
57
Featured Recommended AI Models